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Abstract 

The present paper is devoted to estimating the speed of convergence towards consensus for a 
general class of discrete-time multi-agent systems. In the systems considered here, both the topology 
of the interconnection graph and the weight of the arcs are allowed to vary as a function of time. 
Under the hypothesis that some spanning tree structure is preserved along time, and that some 
nonzero minimal weight of the information transfer along this tree is guaranteed, an estimate of the 
contraction rate is given. The latter is expressed explicitly as the spectral radius of some matrix 
depending upon the tree depth and the lower bounds on the weights. 

Keywords: multiagent systems; distributed consensus; convergence rate; linear time-varying sys- 
tems; uncertain systems; stochastic matrices; Perron-Frobenius theory; mixing rates. 

1 Introduction 

Appeared in the areas of communication networks, control theory and parallel computation, the analytical 
study of ways for reaching consensus in a population of agents is a problem of broad interest in many 
fields of science and technology; see [2] for references. Of particular interest is the question of estimating 
how quickly consensus is reached on the basis of few qualitative (mainly topological) information as well 
as basic quantitative information on the network (mainly the strength of reciprocal influences). 

Originally, this problem was considered in the context of stationary networks. For Markov chains that 
are homogeneous (that is stationary in the vocabulary of dynamical systems), it amounts to quantify 
the speed at which steady-state probability distribution is achieved, and is therefore directly related to 
finding an a priori estimate to the second largest eigenvalue of a stochastic matrix. Classical works on 
this subject are due to Cheeger and Diaconis, [HIS], see also [TT] for improved bounds, as well as [TOlfT^ 
and [18| for a survey. The latter concern reversible Markov chains, for example when the transition 
matrix is symmetric, see e.g. [5] for the non-reversible case. 

Among the classical contributions which instead deal with time- varying interactions we refer to the 
work of Cohn, [7], where asymptotic convergence is proved, but neglecting the issue of relating topology 
and guaranteed convergence rates. Tsitsiklis et al. also provided important qualitative contributions to 
this subject [2D1[1T1[S], as well as Moreau [T3]. See also [T] for further nonlinear results. In particular, 
the role of connectivity of the communication graph in the convergence of consensus and spanning trees 
has been recognised and finely analysed [HI O [TB] . 

More recently, important contributions in characterizing convergence to consensus in a time-varying 
set-up were proven by several authors, see for instance [31 [TS]. See also [SJEj for more specific cases. 

In a previous paper [^ , several criteria were provided to estimate quantitatively the contraction rate 
of a set of agents towards consensus, in a discrete time framework. The attempt there consisted in 
following the spread of the information over the agent population, along one or more spanning-trees. 
Ensuring a lower bound to the matrix entries of the agents already attained by the information fiow 
along the spanning-tree, rather than the nonzero contributions as classically, permitted to obtain tighter 
estimates with weaker assumptions. Distinguishing between different sub-populations, of agents already 
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touched by spanning-tree and agents not yet attained, and using lower bounds on the influence of the 
former ones, one is able to establish rather precise convergence estimates. 

As a matter of fact, rapid consensus can be obtained in two quite different ways — either by dense 
and isotropic communications (based, say, on a complete graph), or by very unsymmetric and sparse 
relations (with a star-shaped graph with a leading root) . In the first case many spanning trees cover the 
graph, while in the second configuration a unique one does the job. 

The present article is a continuation of [2] . Emphasis is put on propagation of a unique spanning tree 
and on the resulting consequences in terms of convergence speed. It is demonstrated that in the particular 
case where such a spanning tree structure is guaranteed to exist at any time, ensuring minimal weight to 
the transmission of information along the tree (from the root to the leafs) indeed enforces some minimal 
convergence rate, whose expression is particularly simple. A worst-case estimate is provided, expressed 
as the spectral radius of certain matrix whose size equals the depth of the tree and whose coefficients 
depend in a simple way of the assumed minimal weights. This results in a sensible improvement over 
existing evaluations. 

The paper is organized as follows. Section [2] contains the problem formulation and a presentation of 
the main result, together with the minimal amount of technical tools to allow for its comprehension. A 
comparison system is introduced afterwards in Section [31 whose study is central to establish the conver- 
gence estimate. The original method for analysis of this system is used in Section |3 to get convergence 
rate estimate (therein is stated the main result of the paper. Theorem [3]), and some properties of the 
latter are studied. This result is commented in Section [5l before some concluding remarks. The proofs 
are sent back to Appendix. 



Notations 

The i-th vector of the canonical basis in the space R" (1 < i < n) is denoted e"; the vector with all 
components equal to 1 in R" is written 1". When the context is clear, we omit the exponent and just 
write Ci, resp. 1 to facilitate reading. We also use brackets to select components of vectors. All these 
notation are standard, and for a vector x e R", the z-th component is written alternatively Xi, [x]i, 
{efYx or ejx. 

The systems considered here will be composed of n agents: accordingly, we let Af ^ {1, . . . ,n}. 

As usual, identity and zero square matrices of dimension qx q are denoted Iq and 0^ respectively. We 
denote Jq the q x q matrix with ones on the sub-diagonal and zeros otherwise: {Jq)i.j = Si^j^i. Here, 
and later in the text, S denotes the Kronecker symbol, equal to 1 (resp. 0) when the condition written in 
the subscript is fulfilled (resp. is not). For self-containedness, recall that a real square matrix M is said 
stochastic (row-stochastic) if it is nonnegative with each row sum equal to 1. 

The spectral radius of a square matrix M is denoted Ainax(-^^)- Last, we use the notion of nonnegative 
matrices, meaning real matrices which are componentwise nonnegative. Accordingly, the order relations 
< and > envisioned for matrices are meant componentwise. 



2 Problem formulation and presentation of the main result 

Our aim is to estimate the speed of convergence towards consensus for the following class of time- varying 
linear systems: 

x{t + 1) = A{t)x{t) (1) 

where A{t) = {ai,j{t))(ij) G R"^" is a sequence of stochastic matrices (in particular, A{t)l = 1; this is 
exactly the dual of what happens in the case of non-homogeneous Markov chains, where the probability 
distribution, written as a row vector w{t), verifies rather a relation like Tr{t + 1) = iT{t)A{t)). 

Let us first introduce some technical vocabulary to present in simplest terms the main result of the 
paper, afterwards enunciated in Section 3) The definition of the quantity we intend to estimate is as 
follows. 

Definition 1 (Contraction rate). We call contraction rate of system ([T|) the number p G [0, 1] defined 
as: 

(max Xj (t) — min (t) 
maxxi(O) - mmx^O) 
ieM is A; 
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where the supremum is taken on those x{0) for which the denominator is nonzero. 

The contraction rate is thus related to the speed of convergence to zero of the agent set diameter. In 
what follows, the latter plays the role of a Lyapunov function to study convergence to agreement. For 
stationary systems, as is well known, the number p is indeed the second largest eigenvalue of the matrix 
A. More in general, it corresponds to the second largest Lyapunov exponent of the considered sequence 
of matrices A{t). 

Definition 2 (Communication graph). We call communication graph of system ((T|) at time t the directed 
graph defined by the ordered pairs {j, j) G A/" x A/" such that ai,j(t) > 0. 

In the present context, we use indifferently the terms "node" or "agent". 

We now introduce assumptions on the existence of a constant hierarchical structure embedded in the 
communication graph, and on minimal weights attached to the corresponding links. 

Assumption 1. For a given positive integer Td > 0, called the depth of the communication graph, 
assume the existence of nested sets Ao , . • . , A/t^ such that 

• Aq is a singleton ( whose element is called the root ); 

• JVk cMk+i; 

. AAt, =AA={l,...,n}. 

Assume in addition, for given nonnegative real numbers a, (3, 7, that, for all t > and all k G 
{1,2,. ..,T4 

ai^,{t)>a ifieJVo, (2a) 
J2 a,,,{t)>P ifteMkWk-i, (2b) 

a,j(t)>7 if I e Afk\Afk-i- (2c) 

As an example, the sets Afc may be induced by some fixed spanning tree embedded in the communi- 
cation graph: the existence of a distinguished agent, the root, is presupposed and, although the matrices 
A(t) and the underlying communication graphs are allowed some variations, information progress from 
this root along a (time-varying) tree to attain all the agents. The number bounds from above the 
minimal time for the information to attain the most distant agents from the root. Likely, we call 
di = mm{k : i E Afk} the depth of agent i. The set Afc indeed consists of all the agents i whose depth 
di is guaranteed by Assumption [T] to be at most equal to k. An example of (fixed) communication graph 
and the associated nested sets is shown in Figure [TJ 

In addition to the spanning tree structure. Assumption [T] imposes some minimal weights to the 
information transmitted downstream along this structure (this is the role played by 7), and also to the 
information used between agents located at same depth. Concerning the latter, expressed by condition 
(|2b[) . remark that it is fulfilled by self-loops, that is when 

a,4t)>f3 iffi, >0 

(because by definition, i G Afd^ \ J^di-i for di > 0); but it is indeed weaker: it allows just as well 
communications between agents whose depths are equal. The constraint on the self-loops of the root 
agent, measured by a, is different than for the other agents (/3); this is done on purpose, and permits 
to treat simultaneously the case of leaderless coordination and 'pure' coordination with a leader (case 
corresponding to a = 1). 

Last, notice that, the matrices A{t) being stochastic, one should have: 

a, P + 1 <1 

for Assumption [1] to be fulfilled. 
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Figure 1: The nested sets and the spanning tree 



We are now in position to present the contents of Theorem [31 The latter states that, under the 
conditions exposed above, the rate of convergence of system ([T]) is at most equal to the spectral radius of 
the Td X Td matrix C,Td{oi, defined by 
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with a* = minjo!, 7}, /?* = min{/3 + 7, a} — a* 



A major characteristic of this estimate is that it is independent of the number n of agents: it only 
depends upon a, (3, 7 and the depth Td. 

We introduce, in the rest of the present Section, a general example where Assumption [T] is naturally 
fulfilled. 

Definition 3 ((a, (3, 7)-tree matrix). For every nonnegative numbers a, /3, 7, we call (a, /3, 7)-tree matrix 
any matrix M G M"^" defined by the recursion formula 



Ml = [a] 



M, 







7u'~-'- 





M = A/„ 



where, for all i 



n, the vector is a vector of the canonical basis in . 



Notice that the agents have implicitly been numbered by the tree matrix representation: the pertinent 
information propagates from smaller to higher indexes. A central case where Assumption [1] holds is given 
by the following result. 

Proposition 1. Let A{t) be stochastic matrices. Assume the existence of a sequence of (a, [3, j) -tree 
matrices M{t) = (?7ijj(t))(jj) G R"^" such that for all t £ N 

A{t) > M{t) . 

Then, after some finite time, system ((T|) fulfills Assumptions^ with 

Nk = {i£N : r, < k} 
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Figure 2: Trees of depth 2 inducing a nested structure of depth 3. 



and Td = r„ recursively defined as 

ri = 0, ri = 1 + maxjrj : j < i and mi_j{t) > infinitely many times} , 2 < i < n . 
We emphasize the fact that the lower bound M(t) may vary upon time. 

By construction, in Proposition!!] verifies: 1 < Id < n — 1, and does not depend upon the ordering 
of the matrices M{t). Moreover, it may be proved directly that in the particular case of constant M, Td 
is the depth of the associated graph (defined in Definition [5]) ; generally speaking, however, the depth of 
a tree matrix sequence is at least equal to the limsup of the depths of the individual matrices M(t). To 
prove both properties, it is sufficient to remark that, in the case of constant M , the previous formula 
indeed computes the depth of the associated graph. Figure [2] presents the case of two matrices for which 
the suprcmum of depths is equal to 2, that is strictly less than the depth of the sequence of matrices 
obtained by alternatively taking each of them, which is here equal to 3 (and also strictly less, in this 
case, than n — 1 = 4). One can take the numbers di defined in Assumption [T] equal to the corresponding 
numbers given below, a quite natural choice which yields in the present case: 

di = 0, fi2 = Ij c?3 = 1 + max{(ii, ^2} = 1 + max{0, f } = 2, ^4 = ^5 = 3 . 

Proof of Proposition^ Let the family of sets Nk be defined as in the statement. Clearly Ao = {1} is 
a singleton, and also mi^i(<) > a for all t > as desired. Moreover, J\fk\Afk-i = {i E Af : r^ = k}. 
In particular then, i E AfrA-^ri-i for all i > 1. Since mi,i{t) > (3 for all i > 1 and all t > 0, it is 
straightforward to verify that 

mjj(t) > mj,j(t) > /3 

as desired, for Assumption 1 to hold. Finally, let i G J\fk\Mk-i , viz. = k. This yields mi,j{t) > 
for some j g A4-i for infinitely many times. Actually, more is true due to the special structure of 
tree matrices, namely m^.j (t) > 7 for infinitely many ts. We claim that for all t larger than some finite 
time Ti there exists j{t) G Afk-i such that mij(t){t) > 7. Indeed, let Ft{i) G {l,2,...j— 1} denote 
the father of the i-th node in the tree matrix M{t), that is the unique index j such that Mi_j{t) > 0; 
clearly, mi_p^^i^{t) > 7 for all t. Indeed, for all sufficiently large is, Ft{i) S Mk-i, (otherwise Ft{i) ^ 
Mk-i infinitely many times and, therefore, we would have > fc + I, which is a contradiction). Let 
Td = voAyii^jsfTi. For all subsequent times, we have: 

^ m,;j(i) > m,,_F^(,)(i) = 7. 
This concludes the proof of the Proposition. □ 
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3 A comparison system for the diameters evolution 



Wc now build an auxiliary time- varying system, with a simpler structure than ([I]), and with the property 
that the asymptotic contraction rate of the original system can be bounded from above by carrying out 
suitable computations on this newly introduced system. Our main result for the present section is a 
statement relating convergence of ([T]) towards consensus of a comparison system introduced below. 

Theorem 2. Assume system ([T]) fulfills Assumption]^ for given nonnegative numbers a,(3,j (such that 
a, /? + 7 < 1). Let A(t) be defined by 



A{t) = 



/ maxxi(t) — minx,(t) \ 

maxxi(t) — min 
,:gM ^ ' ,:gM ^ ' 



max Xi(t) ~ min Xi(t) , 
Then, A(t) satisfies the following inequality: 



A{t + 1) < 







Or, 



CT.(a*,/3*) 



where Cr^a* , 13*) £ 



Ct, (a^ n = il-a*- P*) \e\ + /3*/t, + «Vt, 
a* minjo!, 7}, /3* = min{/3 + 7, a} — a* . 



(3) 



(4) 
(5) 



Recall that inequality Q is meant componentwise. A complete proof of Theorem [5] is provided in 
Section EH 

Remark 1. Two special cases of interest as far as application of Theorem[5]are obtained for the following 
values of parameters: 

1. a = 1: viz. communication graph admits a leader^ under such premises, expressions for a* and /3* 
simplify as follows: 

a* = 7 

2. a = /3, viz. root agent is not different from any other member of the group in terms of self-confidence 
on his own position in the formation of consensus: 



a* = min{/3, 7} /3* = max{0, /? - 7} 



4 Convergence rate estimate and properties 

Based on Theorem [2 we now provide Theorem [31 which states properly the property announced in the 
beginning of the paper. 

Theorem 3. Consider the linear time-varying dynamical system ([T]), with A{t) stochastic. Assume 
Assumption]^ is fulfilled. Then, the contraction rate towards consensus can be bounded according to the 
following formula: 

P < PTAa*,n = A„,ax(CT,(a*,r)) , (6) 
with (I'T'^ja*,/?* given in dH) and ©. 

Proof of Theorem [3] is given in Section E21 Recall that stochasticity of A{t) implies that the non- 
negative scalar a,/3,7 verify: a < 1, P + j < 1. 

Theorem [3] provides a tight estimate for the contraction rate of ([T]) on the basis of the parameters 
a, (3 and 7, and of the depth Td of the sequence of tree matrices. We emphasize the fact that the 
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result holds for time- varying systems. Indeed, Theorem [3] is an inherently robust result, as Assumption 
[T] allows for much uncertainty in the definition of system ([T|) . This robustness is meant with respect 
to variations of the communication graph (provided these variations don't violate the set conditions 
of Assumption [1]), and with respect to variations of the coefficients of the matrix A{t) (provided they 
respect the quantitative constraints in Assumption [ij . 

A central fact is that the value in ^ does not depend upon the number of agents involved in the 
network: rather the depth of the graph is involved, which is quite natural. 

Some properties of the estimate are now given. They are indeed useful to have a grasp on the 
asymptotic behaviour of the contraction estimate, as well as on their monotonicity properties; the latter 
are in agreement with the increase of decrease of information available by varying the parameters a, (3 
and 7. 

Theorem 4. Let a* ,13* G (0, 1]. Then for any T G N, pT{a*,(3*) = ^irLaxiCrict* , (3*)) has the following 
properties. 

• pxia* , (3*) is the largest real root of the polynomial equation 

-/3*\'^ fs-l3*\'^'^ s-f3* fl-[3*\ffs-l3*\'^'^ s-f3* \ 



1 

a' J \ oi' J \ a 



• For any T e N, pT{a*,l3*) < pT+i{a* , (3*). 

• For any T eN, 1 < pT(a*,/3*) < 1. 

• pT{a\(3*) < a*+r if and only if T < ■ 

• pria* , (3*) — > 1 when T +00, and more precisely 



(7) 



1- f3* J 13 



Theorem m is demonstrated in Section [A. 31 

The following result, demonstrated in Section lA. 41 studies the variation of px as a function of a, f3, 7. 
When considering px as a function of these quantities, we write pt{ol, (3,^), meaning pT{a* , (3*) for 
a* (a, f3, 7), (3* {a, (3, 7) defined as in dH). 

Theorem 5. For any T G N, 

• the function {a*,f3*) t-^ pT{a*,j3*) is nonincreasing on the set {{a*, (3*) G [0,1]^ : a* + (3* < 1}; 

• the function {a, (3,^) 1— > pt(pi, (3,^) is nonincreasing on the set {(a, /3, 7) G [0, 1]"^ ; /? + 7 < 1}; 

• Z//3 + 7 = + 7'; then pT{a,f3,j) < pT(a,/3',7') when (3 > (3' . 
Moreover, for any T G N, 

• Pt{<^i Pil) ~ ^ if o,i^d only if a ~ or 7 = 0. 

• pripi, (3,^) ^ (3 ~ \ — ^ if and only if a = P + j = 1. 

Notice that the estimates given in the last two points of Theorem [5] are tight: they are reached for 
the following stationary systems: 



Case a = 0: A 
Case 7 = 0: A 
Case a = f3 + J = 1: A 



= (3In + {1 - P){Jn + eiel) 
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Figure 3: Ratios between spectral gaps 

5 Discussion and interpretation of the results 

It is interesting to compare our results with the classical estimate p < ^ifX^^oi^ which is obtained by 
assuming a lower-bound a on the diagonal entries as well as on the non-zero entries of A(i). In our 
set-up this is obtained by letting a = j = (3 = a* and f3* = 0. In order to have an idea on the quality 
of the two estimates, we plot the ratio of the spectral gaps, 

l-pr,K,0) 
1_ Vi _ 

for Td = 2,3,4 in Fig. [31 As it is possible to see, the new estimates are consistently tighter than the 
classic ones; in the best case, viz. for a* « 0, the ratio of spectral gaps approaches Td- So, the quahty 
of the estimates actually improves with respect to the classic bound, as the horizon Td increases. 

When additional information is available, for instance when the coefRcient a,f3,j as given in ^ are 
known, then contraction rate estimates become much tighter with respect to their classical counterparts 
which are not able to discriminate between inner loops of the root node and inner-loops of individual 
agents, as well as strength of inter-agent communication links. In order to carry out a comparison, notice 
that under the assumption of a prescribed a, /?, 7 tree matrix bounding from below A(t), we may assume 
for the classical estimate the following value of a := min{a,/3,7} which indeed is always smaller than 
a* = min{Q;,7}. Hence, the corresponding spectral gaps satisfy: 

1 - 1 - min{a, /3, 7}^'' < 1 - 1 - min{a,7}^<i 
so that, we may compare the classical estimate with the new one by considering the following ratios: 

l-p^.(a^r) l-pT,K,/3*) 
1 - yi-min{a,/3,7f'' " 1 " Vl - a^^" 

We plotted the function at the right-hand side of the previous inequality in a logj^Q scale as a function 
of a* and /?*. In general the ratio depends critically on the tree depth Td, hence we only plot it for 
relatively small tree depths. In particular the results shown in Fig. |4] were obtained. Notice that the 
relative quality of the estimates again increases with Td, and already for T^; = 4 a significant portion of 
parameters space lies in the area in which estimates differ by a 10** factor. The dependence of pT^ upon 
a* and /3* is shown in Fig. [5]for T = 2, 3, 4. This also clearly shows the different monotonicity properties 
highlighted in the previous Section. 




9 



Figure 5: The function [a* , (3*) for Td ~ 2,3,4 (from bottom to top) 



6 Conclusion 

We provide a novel and tight estimate of the contraction rate of infinite products of stochastic matri- 
ces, under the assumption of prescribed lower bounds on the influence between different sets of agents 
which naturally arise by following the information spread along the interaction graph. This improves 
previously known bounds and, when additional information is assumed, exploits the additional structure 
for tightening of several orders of magnitude the previously available estimates. The other crucial factor 
in determining the overall convergence rate is the time Td needed to the information to propagate from 
some root node (which may or may not play the role of a leader) to the other nodes. The bound can be 
computed as the Perron-Frobenius eigenvalue (the spectral radius) of a positive T^-dimensional matrix, 
whose entries depend in a relatively simple way on the parameters characterizing the hypothetic lower 
bounds available. Some monotonicity and asymptotic properties of the bound are also proved. 

A Proofs 

A.l Proof of Theorem [2] 

For each given solution x{t) of H]) wc define two vectors of size T^; + 1 as follows: 



z{t) 



/ maxa;i(t) \ 
max Xi (t) 



\ max xAt) I 



wit) 



it)\ 



/ min X. 
min Xj (t) 



(8) 



in such a way that 



Zk(t) = max Xi(t), Wfc(i) = min Xi{t) 



Notice that A(t) defined in Theorem [5] equals z{t) — w{t), and that, for any i G J\f, WTa-i-i{t) < Xi{t) < 
Our first aim is to compute an upper-bound oi z{t + 1) on the basis of z{t). Let us consider the 
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following estimates. First, for the unique index zq G Ao, 

< aio,jo(i)x,o(t) + ^ajo,i(t) ZT,+i(i) ai„,^„ (t) (t) + (1 - a.^^^o (t)) ZT,+i(t) 

= azi(t) + (1 -a)zT,+i(t) , (9) 
using the fact that aio,io{t) > « > and zi{t) — Xig{t) < Also, for i ^ Afo (that is di > 0): 

x^{t + ^) = ^a,,,(t)a;,(t) = ^ a,j{t)xi{t) + ^ aij(i)a;,(i) + ^ a,,,(i)x,(t) 

< J2 au{t)xi{t)+ J2 <^^At)Mt) + I J2 ""Td+iit) 

= 0;,i{t){xi{t) - ZT^+i{t)) + a^j{t){xi{t) - ZT^+i{t)) + ZT^+i{t) 

< Y au{t){zd,+iit) - ZT,+i{t)) + Y «MW(^d.(i)-2T,+iW) + ^T,+i(i) 

< (3 Zd,+i{t) + J ZdM + - P - l) ZTd+i{t) (10) 

where the last inequality follows considering that Zdi{t) < Zdi+i{t) < ZTd+i(i) and /3 < J2ieAfa \Afd -i '^«.'(^)' 
7 < SisA/'d^-i "^vK*): by Assumptionffl 

We now proceed to compute suitable estimates for the vector z defined in ([8]). In particular, for any 
k < Td, we may derive, exploiting Q and (fTO|) : 



maxxj(i + 1) < max{azi(i) + (1 - a)zT<j+i(t), Pzk+i{t) +jzk{t) + (1 - /3 - 7) ZTa+iit)} ■ 

One has zi{t) < Zk{t) < Zk+i{t) < ZTd+i(^)- Hence, the previous inequality implies: 

Zk+i{t + 1) < max{azfc(<) + (1 - a)zT,+i(<), /3zk+i{t) + iZk{t) + (1 - /3 - 7) zr^+iit)} 
and thus 

Zk+i{t + 1) < min {/?, max{Q; - 7,0}} Zk+i{t) + min{Q;,7} Zfc(t) 

+ (1 — min{a, 7} — min{/3, max{a — 7, 0}}) ZT^+i{t) , 

that is: 

Zk+i{t + 1) < /3* zu+i{t) + a* Zfe(t) + (1 - a* - (3*) ZT,+i{t) (11) 

with the nomenclature adopted in ([5]). 

A symmetric argument can be carried out for the minima w defined in ([S]). In analogy to the formulas 
jH]), Uni) obtained in the previous paragraphs, we get: 

Xio{t + l) = ajg^i {t)xi (t) ^ a,p {t)x^„ (t) + a,p j {t)xi (t) 

> + WTd + l{t) ^ a^^^^^{t) X^^{t) + {1 ~ a,^^,^{t)) WTd + l{t) 

= flio.ioW (a^JoW -''«Td+iW) +''«Td+i(i) > awi(t) + (1 -a)w;Trf+i(i) , 
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and, for i ^ Afo, 

> ^ aij{t)xi{t)+ ^ aij{t)xi{t) + \ ^ aij(i) WTrf+i(0 

> Pwd,+i{t)+jWdM + i^-P-l)wTd+i{t) ■ 
Similarly to (fTT|) . we get 

+ > r Wk+i{t)+a* Wk{t) + {I ~ a* ~ P*) WT,+i{t) ■ 
Putting now together (fTTj) and (jA.ip leads to: 

Afc+i(t + l)<r Afc+i(t)+ a* Afe(t) + ATd+i(t) • (12) 
On the other hand, one also have 

Ao(t + l) = = Ao(i) . 
Gathering this inequalities yields ^ and proves Theorem [2] 

Remark 2. Notice that alternatively to (fTTj) . the following estimate is also valid: 

maxa;i(i + l) < max {azi(t) + (1 - a)zTd+i(t), f3zk+iit) + jzk{t) + {1 - f3 ~ ZT^+i{t)} 

< min{a,/3} + min {7, max{Q; - /3, 0}} Zfe(i) 

+ (1 — minjo!,/?} — min{7,max{Q; — 0}}) ZT^_|-i(t) . 

This yields the result of Theorem [2] with a* = min{7,max{a — /?, 0}}, /?* = min{a,/3} instead of 
but the sequel demonstrates that the corresponding estimates are less precise (see Remark [3] below). □ 

A.2 Proof of Theorem M 

All the factors in © being nonnegative, the order relation is compatible with multiplication. One then 
obtains, for all t G N, 



1 



A(t) < I a* 

0(T<i-l)xl 



OTd 



t 



xl 



CTd(a*,r) 



A(0) < a* 



0(Td-l)xl 



OTdXl 



t 



CTd 



(,.,^.) (iTd)A..,.(0) 



where the fact that Ai(t) = < Afc(t) < Ak+i{t) < AT^+i{t), 1 < fc < + 1, have been taken into 
account. 

One deduces that 

'^'"^ ^"^-«'»> 

and 

ATd+i(t) = e^^;V^'A(t) < e?f CTd(a^r)*l'^''ATd+l(0) , 

from which it ensues 

ATd + l(0\'^* TdT,„ «*^HTd^l/* 



<limsup||CTd(a^/3*)*|| ' = A„,ax(CTd (a*(A), ^ (A))) 

t — » + C« 
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see e.g. [TU Corollary 5.6.14] for a demonstration of the previous equality. This yields ^ and achieves 
the proof of Theorem [3] 

Remark 3. Recall that by virtue of Remark [51 one is allowed to choose a* and f3* according to (O or 
to 

P* = min{Q;, /3}, a* = min{/3 + j,a} - (3* . 

Besides, due to linearity of C{t) with respect to a* and /?*, any convex combination of the two formulas 
can still be adopted as a suitable value of {a*, (3*) for the comparison system. In particular then: 

a* (A) = A niinla, 7} + (1 — A) min{7, niaxjo; — /3, 0}} 
/3*(A) = Aniin{/3,max{Q! - 7,0}} + (1 - A)min{a,^} 

Notice also that, no matter how A G [0,1] is selected, the following quantity is constant: 

a*{X) + /3*(A) = min{/3 + 7, a} 

So, it is natural to wonder where the minimum for Ainax(CTd (q;'^(A), /3*(A))) is achieved. We establish 
now that the optimal choice is the one presented in Theorem [3] This is indeed natural, as it amoimts to 
maximize in the contribution of the diameter (i) of the smallest set Mk ■ 

It is well known that, in general, the spectral radius of a linear combination of matrices needs not 
be a convex function; see e.g. |131 p. 364]. The additional structure provided by this problem, however 
allows to exactly identify the optimal value of A as explained below. Consider the cone 

KTd = {w ■■ < Wi < W2 < W3 < . . . < WTa} , (13) 

which is contained in the nonnegative orthant. It is straightforward to verify that ^Tdio/* 1 P*)KTd ^ 
Kt^- In particular, then, by Perron-Frobenius theory [221 Theorem 3.1], we know that Kt^ contains an 
eigenvector, say w, relative to the dominant eigenvalue of (a"*, /?*). 

Moreover application of a classical characterization of the spectral radius Amax(CTi (0!"^, f3*)) [T71 The- 
orem 4], see also (TU], yields: 

An.ax(CT,(«*, r)) = max min flM^liEMl j e {1, . . . , T,}, + 

io>0 Wi 

The previous formula is established considering the nonnegative orthant (lo > 0). 

Now, the matrix CTd(c«*,/3*) is irreducible for the cone constituted by the nonnegative orthant (that 
is: irreducible in the usual sense). One deduces that it admits exactly one eigenvector in the interior of 
the positive orthant [22l Theorem 4.2]. The latter has to be w, and we thus have 

. f [CT.(«-,r)u>], \ . f [C^.(a^/3-^)H. .^.^ 
mm < : i ^ \ \, . . . ,lA> = max mm < : i G }1, . . . , Jdf, m; 

I J «'>0 1^ 

> „,ax min/M^Il^ : .G{l,...,r4, ^..^0 

> mm <^ : z G {1, . . . , Td} ^ , 

using the fact that w is also an element of K^^. In the previous series of inequalities, w > refers to 
the componentwise order relation, and the fact that Kt^ is included in the nonnegative orthant has been 
used. We thus deduce: 

A.„ax(CT.(a^r))= max min ( ^^^^ ^^^'^^^ : zg{1,...,T4, w, + (\ . (14) 
Now, the following expression holds for (q;'^(A), /3*(A)), 

CT,(a*(A), /3*(A)) = min{/3 + 7, «}le^.^ + (i\\)lT, + a*(A) Jr, • 

Hence, for all w G Kt^, the function CTd(Q;*(A),/?*(A))'u; is (cntrywise) monotone with respect to A. 
Henceforth, by (HI]), Aniax(CTi(a'^(A),/?*(A))) is also monotone with respect to A. In particular, the 
minimum is achieved when a* is maximum. In our notation this corresponds to A = 1. In conclusion, 
the optimal estimate is the one obtained in Theorem [31 □ 
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A. 3 Proof of Theorem 3] 

• Let the characteristic polynomial be xt{s) = dct{slT — (1 — a* — /?*) le^ — (3*It ~ a* Jt)- The matrix 
(1 — a* — ley + /3*/t + a* Jr being irreducible and nonnegative, its spectral radius is indeed a (real) 
eigenvalue; see [121 Theorem 8.4.4]. We thus aim to estimate the largest zero of Xt{s), that we denote 

Developing the determinant with respect to its first column yields, for any T > 2, 

Xris) = is~ nXT-i{s) - a*^~\l - ot - P*) . (15) 
It is then easy to check that 

Xt{s) = {s - P^f + {a^ + P* - 1) ((s - P*f~^ + a*{s - p*f-^ + • • • + a*^-\s - + a*^"') , 

and that xt[s) = if and only if ([7]) holds. In this relation, the coefficient "'""f is at most equal to 1. 
One sees directly that the scalar pT{a*, P*), the largest root of (I7|), is thus smaller than 1. 

• The fact that xi(s) = (s — 1 + a*) and formula (fT5|l permit to show recursively that, for any T >2, 

XT{l-a*), Xt{P*)<0. 

As 

lim Xris) = +oo 

S — > + CxD 

(in which the sign of the limit is crucial), one then deduces that, for any T > 1, 

PT{a\P*) >l-a\P* . 

• One deduces from ([TS]) that 

XT-i{s) = ^ xt{s) = -a*^-\l -a*-P*)<0, 
from which we conclude that, for T > 2, 

PT{a ,P*)> pT-i{a ,P*) . 

• The fact that pT{a* , P*) < a* + P* if and only if T < is obtained from the observation that 

^XT{a* + P*) = (T + 1) - = + l)T + a*) . 

a" a* a 

• Let us now determine the limit of px^a* , P*) when T — > +oo. As asserted by the inequality just proved, 
one may assume, for large enough T, that pt{q:* , P*) > a* + P* . In these conditions, one may multiply 

by _ 1^ both sides of Q taken in s = pT{a*,P*). This yields the identity: 

(pT{a\P*) - P*Y+' _ ^ ^ / f pTia*,P*)-P* Y ^ 



that is: ^ 

{pT{a\p*)-l)( Y^^''^'^P~^* ^ =(a*+r-l)>0. (16) 

The results in Theorem H] show that the factor {pT{a* , P*) — 1) is bounded when T +oo. On the 

other hand, the term a*'^~^ ) either tends to zero, or to infinity. In these circumstances, in 

order for the previous identity to be verified for T sufficiently large, the only possibility is that, when 

T +00, 

{PT{a , P ) - 1) ^ 0, I — I ^ +00 . 

Considering then that 

>T(a^/3*)-r^^ fi-p*'^ 



a* J \ a* 

identity (|16p yields the announced expansion formula. 
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A. 4 Proof of Theorem H] 

• As exposed in Remark O pT{ct*,P*) is the largest eigenvalue of (t{c(* , f3*), associated to a single 
eigenvector in M."^ , say w, contained in the cone Kt defined in (jl3p . 

By the very definition of this matrix, 

Ct(Q!*,/3^)w = pT{a* ,(i*)w ■ 
For any other pair (a**, /?**) and for any i e {2, . . . , T}, one has 

[C,T{a** , I3**)w\i = (1 - a** - [3**)[w]t + P**[w]i + a'*[w\i-i . 

Thus, 

[QT{a\pn^^ - [CT{a\nMr - {o^" + r - a* - r*)NT + (r* - ni^v + - a)[wv-i 

- - ni.[w]^ - Nt) + {a'* - a*){[wl., - \w\t) ■ 

Taken into account the fact that w € Kt-, which implicates — [w]t, [i&li-i ~ [w]t < 0, one deduces 
that: if < («*,/?*), then, for i = 2, . . . ,T<j, 

[CT(a'^^r'')^«]. > [CT(a*,/3'^)u;], = PT(a^r)N^ • 

Similarly, for 1=1, one has 

[CT(a**,/3**)^«]i = (1 - - /?**)[^i']T + /3**[u;]i , 

and 
so 

[CT(a*^/3^*)^&]l > [CT(a*,/3*)^]i =PtK,/3'^)[i&]i ■ 
To summarize, we thus have: 

CTK*,/3**)w>PT(a*,r)^& , 
and the fact that Ct(Q!'^*, /?**) has nonnegative components certainly implies [171 Theorem 4] that 

PT(a*,r) < A„,ax(CT(a'^*,r*)) = PtK*,/3**) ■ 

The nonincrcasingness of pT{oi* ^ (3*) with respect to {a*, 13*) is thus proved. 

• Careful examination of ^ shows that 

do^ da^ dl3^ d(3^ _d(3^ ^ da* _^ 

da ' d"f ' da ' 8(3 ' ' ' dp 

One thus shows easily, with the help of the previous result, that pxict, (3, 7) is nonincreasing in a and 
in p. 

The study of the influence of 7 requires to distinguish between three different cases. Based on ([S]), 
one has: 

• if a < 7, then a* = a, (3* = (3, so that prio:, (3, 7) is locally insensitive to variations of 7; 

• if a > /3 + 7, then a* = 7, /3* = /3; so that an increase of 7 induces an increase of a*, does not 
affect /3*, and therefore provokes a decrease of ^^(a,/?, 7). 

• in the intermediate case where 7 < a < /3 + 7, one has a* = 7, /?* = a — 7, so that the rationale 
developed in Remark [3] holds, indicating a decrease of pria, (3,"f) with respect to a*, i.e. with 
respect to 7. 
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• We now demonstrate the comparison relation stated in Theorem [5] for constant value of /3 + 7. When 
this quantity, together with a, is unchanged, the sum a* + f3* is conserved. Arguing one more time as 
in Remark[21 pxict* , (3*) is nonincreasing with respect to a* = min{a,7}, and thus nondecreasing with 
respect to f3. 

• Concerning the computation of pxia, f3,j) when a or 7 is null, it is straightforward when remarking 
that in both cases, a* = 0. 

Conversely, due to the monotonicity property already established, the largest value of pT{a,(3,j) is 
attained for extremal values. The nullity of /3 alone, which yields a* ~ min{Q!,7}, (3* = 0, does not 
induce that the maximal value is attained (see ([T])). Also, when a = but 7 7^ (resp. 7 = but a ^ 0), 
then = 1 (resp. = 1), and any increase of a (resp. 7) produces a decrease of pria, (3,j). 

• Last, when a = 1 and /3 + 7 = 1, then a* = 1 — (5, (3* = /3, and the matrix CT(a*,/3*) is upper 
triangular, with f3 on the diagonal. 

Reversely, decrease of a or /3 (resp. 7) implies increase with respect to 1 — 7 (resp. f3). 
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